首页> 外文OA文献 >Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation
【2h】

Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation

机译:弱监督语义分割的自下而上自上而下的线索

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We consider the task of learning a classifier for semantic segmentation usingweak supervision in the form of image labels which specify the object classespresent in the image. Our method uses deep convolutional neural networks (CNNs)and adopts an Expectation-Maximization (EM) based approach. We focus on thefollowing three aspects of EM: (i) initialization; (ii) latent posteriorestimation (E-step) and (iii) the parameter update (M-step). We show thatsaliency and attention maps, our bottom-up and top-down cues respectively, ofsimple images provide very good cues to learn an initialization for theEM-based algorithm. Intuitively, we show that before trying to learn to segmentcomplex images, it is much easier and highly effective to first learn tosegment a set of simple images and then move towards the complex ones. Next, inorder to update the parameters, we propose minimizing the combination of thestandard softmax loss and the KL divergence between the true latent posteriorand the likelihood given by the CNN. We argue that this combination is morerobust to wrong predictions made by the expectation step of the EM method. Wesupport this argument with empirical and visual results. Extensive experimentsand discussions show that: (i) our method is very simple and intuitive; (ii)requires only image-level labels; and (iii) consistently outperforms otherweakly-supervised state-of-the-art methods with a very high margin on thePASCAL VOC 2012 dataset.
机译:我们考虑使用弱监督以图像标签的形式学习用于语义分割的分类器的任务,图像标签指定了图像中存在的对象类别。我们的方法使用深度卷积神经网络(CNN),并采用基于期望最大化(EM)的方法。我们集中在以下三个方面:(i)初始化; (ii)潜在后验重估(E步)和(iii)参数更新(M步)。我们显示了简单图像的显着性图和注意力图,分别是我们的自下而上和自上而下的线索,为学习基于EM的算法的初始化提供了很好的线索。直观地表明,在尝试学习对复杂图像进行分割之前,先学习对一组简单图像进行细分然后转向复杂图像是更加容易和高效的。接下来,为了更新参数,我们建议最小化标准softmax损失和真实潜在后验概率(CNN)给出的KL散度的组合。我们认为,这种组合对于由EM方法的预期步骤做出的错误预测更为可靠。我们以经验和视觉结果支持这一论点。大量的实验和讨论表明:(i)我们的方法非常简单直观。 (ii)仅需要图像级标签; (iii)在PASCAL VOC 2012数据集上始终以非常高的优势持续优于其他弱监督的最新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号